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aw Sequence KEsI^ffi 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER; 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE*INSfeRTED BY PTO SOFTWARE 
Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. 

This may occur if younfile was retrieved In a word processor after creating it. 

Please adjust your right margin to .3, as this will prevent "wrapping". 

Wrapped Aminos The amino ackJ-numberAext at the end of each line "wrapped " down to the next line. 

This may occur if your file was retrieved in a word processor after creajing it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Incorject Line Length The rules require that a line not exceed 72 characters In length. This includes spaces. 
Misaligned Amino Acid The numbering under each 5th amino acid is misaligned. This may be caused by the'use of tabs 



Numbering 



between the numbering. It is recommended to delete any tabs and use spacingToetwlferY fhe numbers 



Non-ASCII 



Variable Length 



This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII textjso that it can be processed. 

Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 



Patentln ver. 2.0 "bug" 



A "bug** in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 

Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 



10 



Skipped Sequences 
(NEW RULES) 



Use of n's" or Xaa's 
(NEW f&L^)' 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
*400> sequence id numbeig 
000 



BEST AVAILABLE COPY 



Use of n's and/orXaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



11 Use of <213>Organism Sequence(s)_ 

(NEW RULES) 



are missing this mandatory field or its response. 



12 Use of <220>Feature 

(NEW RULES) 



Sequence(s) are missing the <220>Fealure and associated headings. 

Use of <220> to <223> is MANDATORY if <21 3>ORGANISIvf is "Artificial" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 6/01/98, Vol 63* No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 
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Patentln ver. 2.0 "bug" 



Please do not use "Copy to Disk** function of Patentln version 2.0. This causes a corrupted 
Tile* TeTCJUing in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 
Instead, please use "File Manager" or any other means to copy file to floppy disk. 



AKS-Biptechnology Systems Branch- 5/15/99 




Page 1 of 7 



OIPE 



RAW SEQUENCE LISTING DATE: 12/07/2000 

PATENT APPLICATION: US/09/7 16 f 9 64 TIME: 07:50:03 

Input Set : A:\RlO3 01.app Does Not Comply 

output set: K,\cRMM2072oooM7i6 96 4.raw Corrected Diskette Needed 

3 <110> APPLICANT: O ' Donne 11 , Michael E. 

4 Ynzhakov, Alexander" ; 
. 5 Yurieva, Olya ^-n 

6 Jeruzalmi, David ' ' 

1 B.ruck, i.rlna 

8 Kurlyan, John 

10 <120> TITLE OF INVENTION: ENZYMES DERIVED PROM THERMOPHILIC ORGANISMS THAT 

11 FUNCTION AS A CHROMOSOMAL REPLICASE, PREPARATION AND 

12 0SE THEREOF 

14 <130> FILE REFERENCE: 22221/1030 
■> 16 <140> CURRENT APPLICATION NUMBER: US/09/716,964 
-> 17 <141> CURRENT FILING DATE: 2000-11-21 

19 <15U> PRIOR APPLICATION NUMBER: 60/14 3,202 

20 <151> PRIOR FILING DATE: 19 9 7 -04 -0B 
22 <150> PRIOR APPLICATION NUMBER: 08/8 23,407 

2 3 <151> PRIOR FILING DATE: 1997-04-08 

25 <150> PRIOR APPLICATION NUMBER: 09/057,416 

26 <151> PRIOR FILING DATE: 1998-04-08 
28 <160> NUMBER OF SEQ ID NOS : 212 

3 0 <170> SOFTWARE: Paten tTn Ver. 2.1 
3 2 <210> SEQ ID NO: 1 

33 <211> LENGTH: 2007 

34 <212> TYPE: DNA 

3 5 <213> ORGANISM: Thermus thermophilus 
3 7 <4 0 0> SEQUENCE: 1 

3 8 tccyggggtg gggttcccag gtagaccccg gcccctcccg tgagcccctt tacecaygcc 6 0 

39 gccacctcct ccaggggggc caaygcgtgc aaggagagga acgtccgcac cacgccctat 120 

40 actagccttg tgagcgccct: ctaocgccgc ttccgccccc tcaocttcca ggaggtggtg 180 

41 gggcaggagc acgtgaagga gcccctcotc aaggccatcc gggaggggag gctcgcccag 240 

4 2 gcctacctct tctccgggcc caggggcgtg ggcaagacca ccacggcgag gctcctcgcc 300 
4 3 atgycygtgg ygtgccaggy ygaagacccc ccttgcgggg tctgccccca ctgccaygcg 360 
44 gtgcagaggg gcgcccaccc ggacgtggtg gacattgacg ccgccagcaa caactccgtg 420 
4 5 gaggacgtgc gygagctyag ggaaaygale cacctcgccc cectctctgc ccccaygaag 480 
4 6 gtctucat.cc tggacgaggc ccacatgctc tccaaaagcg ccttcaacgc cct.cctcaag 540 

4 7 accctggagg agcccccgcc ccacgtcctc ttcgtcttcg ccaccaccga gcccgagagg 600 

48 atgcccccca ccatcctctc ccgcacccag cacttccgct tccgccgcct cacggaggag 660 

49 gagatcgcct ttaagctccg gcgcaticctg gaygccgtgg gycgggagyc ggaggaggag 720 

50 gccctcctcc tcctcgcccg cctggcggac ggggccctta gggacgcgga aagcctcctg 780 

51 gagcgcttcc tcctcctgga aggccccctc acccggaagg aggtggagcg cgccctaggc 84 0 

52 tcccccccag ggaccggygt ggccgagauc gccgcctccc tcgcgaggyg gaaaacggcg 900 

53 gaggccctgg gcctcgcccg gcgcctctac ggggaagggt acgccccgag gagcctggic 960 

54 tcygyccttt Lyyagytytt ccgggaayyc ctctacgccg ccttcggcct cgcgyyaacc 102 0 

5 5 ccccttcccg ccccgcccca ggccctgatc gccgccatga ccgccctgga cgaggccatg 108 0 
5 6 gagcgcctcg cccgccgcLc cgacgcctta agcctggagg tggccctcct; ggaggcggga 1X4 0 
5 7 ayygccctgg ccgccgaggc cctaccccag cccacggycg ctccttcccc agaggtcggc 1200 

58 cccaagccgg aaagcccccc gaccccggaa cccccaaggc ccgayyayyc gcccgacctg 1260 

59 cgggagcggt ggcgggcctt cctcgaggcc ctcaggccca ccctacgggc cttcgtgcgg 1320 



6£Sr miLABL 



& Copy 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/716,964 



DATE: .'12/07/2000 
TIME: 07:50:03 



Input Set : A:\R10301.app 

Output Set: N:\CRF3\12072000\I716964.raw 



60 


guggcccycc 


cggaggLccg ggaaggccacj 


ct.ctgcctcg 


ctttccccga ggacaaggee 


1380 


61 


ttccactacc 


gcaaggcctc ggaacagaag 


gtgaggctcc 


tceccctggc ccaggcccal 


1440 


62 


ttcgqggtgg 


aggaggtcgt cctcgtcctg 


gagggagaaa 


aaaaaagect gayeccaagg 


1500 


6 3 


ccecycccgg 


ccccacctcc tgaagcgccc 


gcaccccegg 


geccLcccga gqaggaggta 


156 0 


64 


yaqycgyagg 


aagcggcgga ggaggccccy 


gagguggect 


t g a g gc g gg t g y teege c tc 


1620 


6 5 


Ctyggggggc 


9 9 g t g c t c t g g g tg c g g egg 


cocaggaccc 


gggaggegee ggaggaggaa 


16 8 0 


66 


cccctgaycc 


aagacgagat agggggtact 


gq ta tataat 


gqggycatga cqcggaccac 


17 4 0 


6 7 


cgacctegga 


caagagaccg tggacaaeat 


cc teaayege 


ctccgccgta ttyagggeca 


18 00 


68 


ygtgcggggg 


e t c c a g a a g a t g g t g g c c g a 


gggccgccee 


tgegacgagg tcctcaccca 


1860 


6 9 


gatgaccgcc 


accaagaagg ccatggaggc 


ggcggccacc 


ctgatcctcc acgagttcct 


1920 


70 


gaacgtctgc 


gccgccgagg tctccgaggg 


caagg tgaac 


cccaagaagc ccyaggagat 


1980 


7 1 


cgccaccatg 


ctgaagaaet tcatcta 
















2007 


7 4 


<210> SEQ ID NO: 


2 
























75 


<211 


> LENGTH: 5 2 


9 
























76 


<212 


> TYPE: 


PRT 


























77 


< 2 1 3 


> ORGANISM: 


Therm us 


thermophilus 
















79 


<4 00 


> SEQUENCE: 


2 
























80 


Met 


Ser 


Ala 


Leu 


Tyr 


Arg 


Arg 


Phe . 


Arg 


Pro 


Leu 


Thr 


Phe 


Gin Glu 


Val 




81 


1 








5 










10 








15 






8 3 


Val 


Gly 


Gin 


Glu 


His 


Val 


Lys 


Glu 


Pro 


Leu 


Leu 


Lys 


Ala 


lie Arg 


Giu 




84 








20 










25 










30 






86 


Gly 


Arg 


Leu 


Ala 


Gin 


Ala 


Tyr 


Leu 


Phe 


Ser 


Gly 


Pro 


Arg 


Gly Val 


Gly 




87 






35 










4 0 










45 








89 


Lys 


Thr 


Thr 


Thr 


Ala 


Arg 


Leu 


LGU" : 


Ala 


Me t 


Ala 


Val 


Gly 


Cys Gin 


Gly 




90 




50 










55 










60 










92 


Glu 


Asp 


Pro 


Pro 


Cys 


Gly 


Val 


Cys 


Pro 


His 


Cys 


Gin 


Ala 


Val Gin 


Arg 




93 


65 










7 0 










75 








80 




95 


Gly 


Ala 


His 


Pro 


Asp 


Val 


Val 


Asp 


lie 


Asp 


Ala 


Ala 


Ser 


Asn Asn 


Ser 




96 










85 










90 








95. 






98 


Val 


Glu 


Asp 


Val 


Arg 


Glu 


Leu 


Arg i 


Glu 


Arg 


lie 


His 


Leu 


Ala Pro 


Leu 




9 9 








100 








105 










110 






101 


Ser 


Ala 


Pro Arg 


Lys 


val 


Phe 


lie 


Leu 


Asp 


Glu 


Ala 


H is 


Met Leu 


Ser 




102 






115 








120 










125 








104 


Lys 


Ser 


Ala Phe 


Asn 


Ala 


Leu 


Leu 


Lys 


Thr 


Leu 


Glu 


Glu 


Pro Pro 


Pro 




.10 5 




130 










135 










14 0 










107 


His 


Val 


Lgu Phe 


Val 


Phe 


Ala 


Thr 


Thr 


Glu 


Pro 


Glu 


Arg 


Met Pro 


Pro 




108 


145 










150 










155 








16 0 




110 


Thr 


He 


Leu Ser 


Arg 


Thr 


Gin 


His 


Phe 


Arg 


Phe 


Arg 


Arg 


Leu Thr 


Glu 




111 










165 










170 








175 






113 


Glu 


Glu 


lie Ala 


Phe 


Lys 


Leu 


Arg 


Arg 


lie 


Leu 


Glu 


Ala 


Val Gly 


Arg 




114 








180 










185 










19 0 






116 


Glu 


Ala 


Glu Glu 


Glu 


Ala 


Leu 


Leu 


Leu 


Leu 


Ala 


Arg 


Leu 


Ala Asp 


GJ.y 




117 






195 








200 










205 








.119 


Ala 


Leu 


Arg Asp 


Ala 


Glu 


Ser 


Leu 


Leu 


Glu 


Arg 


Phe 


Leu 


Leu Leu 


Glu 




120 




210 










215 










220 










122 


Gly 


Pro 


Leu Thr 


Arg 


Lys 


Glu 


Val 


Glu 


Arg 


Ala 


Leu 


Gly 


Ser Pro 


Pro 




123 


225 










230 










2 35 








24 0 




12 5 


Gly 


Thr 


Gly Val 


Ala 


Glu 


lie 


Ala 


Ala 


Ser 


Leu 


Ala 


Arg 


Gly Lys 


Thr 




126 










245 










250 








255 







3" 

T 
A. 

S3 
ft 
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RAW SEQUENCE LISTING DAI' 1:1 : 12/07/2000 

PATENT APPLICATION: US/09/716,964 TIME: 07:50:03 

InpuL Set : A:\R10 3 01.app 

Output Set: N:\CRF3\12072000\l716964.raw 



128 Ala Glu Ala Leu Gly Leu Ala Arg Arq Leu Tyr Gly Glu Gly Tyr Ala ^ 

129 260 265 270 
131 Pro Arq Ser Leu Val Ser Gly Leu Leu Glu Val Phe Arq Glu Gly Leu * 
.13 2 275 280 285 

134 Tyr Ala Ala Phe Gly Leu Ala Gly Thr Pro Leu Pro Ala Pro Pro Gin Jfef 

135 290 295 300 J^Q 

137 Ala Leu lie Ala Ala Met Thr Ala Leu Asp Glu Ala Met Glu Arq Leu V~ 

138 305 310 315 320 ^/ 
14 0 Ala Arg Ary Ser Asp Ala Leu Ser Leu Gin Val Ala Leu Leu Glu Ala 

141 325 330 335 <*y 

14 3 Gly Arq Ala Leu Ala Ala Glu Ala Leu Pro Gin Pro Thr Gly Ala Pro 
144 340 345 350 

14 6 Ser Pro Glu Val Gly Pro Lys Pro Glu Ser Pro Pro Thr Pro Glu Pro 
147 355 360 365 

14 9 Pro Arq Pro Glu Glu Ala Pro Asp Leu Arg Glu Arq Trp Arg Ala Phe 
150 3 70 37 5 380 

15 2 Leu Glu Ala Leu Arg Pro Thr Leu Arg Ala Phe Val Arq Glu Ala Arq 
153 385 * 390 * 395 400 

155 Pro Glu Val Arq Glu Gly Gin Leu Cys Leu Ala Phe Pro Glu Asp Lys 

156 405 410 415 
15 8 Ala Phe Hi.s Tyr Arg Lys Ala Ser Glu Gin Lys Val Arg Leu Leu Pro 
159 420 425 430 

161 Leu Ala Gin Ala His Phe Gly Val Glu Glu Val Val Leu Val Leu Glu 

162 435 440 44 5 

164 Gly Glu Lys Lys Ser Leu Ser Pro Arg Pro Arq Pro Ala Pro Pro Pro 

165 450 455 460 

167 Glu Ala Pro Ala Pro Pro Gly Pro Pro Glu Glu Glu Val Glu Ala Glu 

168 465 470 475 480 

170 Glu Ala Ala Glu Glu Ala Pro Glu Glu Ala Leu Arg Arg Val Val Arg 

171 485 490 495 

17 3 Leu Leu Gly Gly Arg Val Leu Trp val Arg Arq Pro Arg Thr Arg Glu 
174 500 505 510 

176 Ala Pro Glu Glu Glu Pro Leu Ser Gin Asp Glu lie Gly Gly Thr Gly 

177 515 520 525 
179 He 

183 <210> SEQ ID NO: 3 

184 <211> LENGTH: 1590 

18 5 <212> TYPE: DNA 

186 <213> ORGANISM: Thermus thermophilus 
.1 8 8 < 4 0 0 > SEQUENCE : 3 

189 gtqagcgccc tctaccgccg cttccgcccc ctcaccttcc aggaggtggt qgggcaqgag 60 

19 0 cacgtgaagg agcccctcct caaggccatc cgggagggga ggctcgccca ggcctacctc 120 

191 ttctccgggc ccaggggcgt gggcaagacc accacggcqa ggctcctegc catggcggtg 180 

192 gggtgccagg gggaagaccc cccttgcggg gtctgcceoe actqccagyc ggtgeagagg 240 

193 ggcgcccacc cggacgtggt ggacattgac gccgccagca acaactccgt ggaggacgtg 300 

194 cgggaqctga gggaaaggat ccacctcgcc cccctctctg cccccaggaa ggtcttcatc 3 60 

195 ctggacgagg cccacatgct etccaaaagc gccttcaacg ccctcctcaa gaccctggag 4 20 

196 gagcocccgc cccacgtcct cttcgtcLtc gccaccaccg agcccgagag gatgcccccc 480 

197 accatccLct cccgcaccca gcactbccgc Ltccgccgcc tcacggagga ggagatcgcc 540 



i 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/716,964 
Input Set : A:\RlO3 01.app 

Output Set: N:\CRF3\12072000\l716964.raw 



DATE: .12/07/20 00 
TIME: 07 : 50 : 03 



198 

1 9 9 
200 
201 
202 
203 
2 04 
205 
206 

20 7 
208 
209 
2.10 
211 
212 
213 
214 
215 
218 
219 
22 0 
221 
223 
224 
225 
227 
228 
230 
231 
233 
234 
236 
2 3 7 
239 
240 
242 
2 4 3 
245 
24 6 
24 8 
24 9 
251 
252 
254 
255 
257 
258 
260 
261 



tttaagetcc ggcgcatcct ggagyccgtg gggcgyyagg 
ctectegccc gectygcyga cggggccctt agggacgcgg 
ctcctcctgg auggceccct cacccyqaay yayytyyayc 
gggaccyggg tygecgagat cgccgcctcc ctcgcgagyg 
ggectcgccc ggcycctcta cgyggaaggg t.acgccccga 
ttggaggtgt tccgggaagg cctctacgcc gccttcygcc 
gccccgcccc aygocctgat cgccgccatg accgccctgy 
gcccyccget ccgacgcctt aagcctygay ytggccctcc 
gccgccgagg ccctacccca gcccacgggc gctccttccc 
gaaagccccc cgacccoyga acccccaagg eccgaggagg 
tggeggycct tcctcgaggc cctcagycec accctacggy 
ccggaggtcc gggaagqcca gctctgcctc gctttccccq 
cgcaaggcct cggaacagaa ggtgaggctc ctccccctgg 
gaggaggtcg tcctcgtoct gyagggaqaa aaaaaaagcc 
gccccacctc ctyaagcgcc cgcacccccy yycectccey 
qaagcggcqg aggaggcccc ggaggaggcc ttgaqgcggg 
cgygtyctct qggtgcgycg ycccayyaec cygyayycyc 
caagacgaga tagggggtac tygtatataa 
<210> SEQ ID NO: 4 
<211> LENGTH: 4 64 
<2.12> TYPE: PRT 

<213> ORGANISM: Thermus thermophilus 
<4 00> SEQUENCE: 4 

Met Ser Ala 
1 



Gly Arg Leu 
3 5 

Lys Thr Thi 
50 



cggaggagga 
aaagcctcct 
gcgccctagq 
qgaaaacggc 
ygagcctgyt 
tcgcgggaac 
acgaggcca t 
tygayycygy 
cagagy tcgg 
cgcccgacct 
ccttcgtgcg 
aygacaaygc 
cccaggccca 
tgagcccaag 
aygayyayg t 
tygtccgcct 
cygagyayga 



gyccctcctc 
ggagcyct to 
ctccccccca 
yyaqycccty 
ctcygyc:ctt 
ccccc ttccc 
qgagcgcctc 
aagggcccty 
ccccaagccg 
gcgggagcgg 
gyayycecgc 
cttccac tac 
tttcgyygtg 
gccccycccg 
agaggcgyay 
cctggggygy 
acccctgagc 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

.1. 14 0 

1200 

.1260 

1320 

1 3 8 0 

14 4 0 
1500 
1560 
1590 



Leu 


Tyr 


Ary 


Arg 


Phe 


Arg 


Pro 


Leu 


Thr 


Phe 


Glu 


Glu 


val 




5 










10 










15 




Glu 


His 


Val 


Lys 


Glu 


Pro 


Leu 


Leu 


Lys 


Ala 


He 


Arg 


Glu 


20 










25 










30 






Ala 


Gin 


Ala 


Tyr 


Leu 


Phe 


Ser 


Gly 


Pro 


Ary 


Gly 


Val 


Gly 










4 0 










4 5 








Thr 


Ala 


Arg 


Leu 


Leu 


Ala 


Met 


Ala 


Val 


Gly 


Cys 


Gin 


Gly 








55 










60 










Pro 


Cys 


Gly 


Val 


Cys 


Pro 


His 


Cys 


Gin 


Ala 


Val 


Gin 


Ary 






70 










75 










8 0 


Pro 


Asp 


Val 


val 


Asp 


lie 


Asp 


Ala 


Ala 


Ser 


Asn 


Asn 


Ser 




85 










90 










95 




Val 


Arg 


Glu 


Leu 


Arg 


Glu 


Arg 


He 


His 


Leu 


Ala 


Pro 


Leu 


100 










1.05 










110 






Ary 


Lys 


Val 


Phe 


lie 


Leu 


Asp 


Glu 


Ala 


His 


Met 


Leu 


Ser 










120 










125 








Phe 


Asn 


Ala 


Leu 


Leu 


Lys 


Thr 


Leu 


Glu 


Glu 


Pro 


Pro 


Pro 



6 5 



115 

Lys Ser Ala 
130 

His Val Leu Phe Val 
14 5 

Thr He Leu Ser Arg 
165 

Glu Glu lie Ala Phe 
180 

Glu Ala Glu Glu Glu Ala Leu 
195 



135 140 
Phe Ala Thr Thr Glu Pro Glu Arg Met Pro Pro 
150 155 160 

Thr Gin His Phe Arg Phe Arg Arg Leu Thr Glu 

170 17 5 

Lys Leu Arg Arg He Leu Glu Ala Val Gly Arg 
185 190 
Leu Leu Leu Ala Arg Leu Ala Asp Gly 
200 205 



9 
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RAW SEQUENCE LISTING DAT 13 : .12/0 7/2 000 

PATEN T APPLI CAT 10 N : US/0 9 /7 16,964 TIME: 07:50:03 

Input Set : A:\Rl0301.app 

Output Set: N:\CRF3\12072000\l716964.raw 



263 


Ala 


Leu 


Arg 


Asp 


Ala 


Glu 


Ser 


Leu 


Leu 


GlU 


Arg 


Phe 


Leu 


Leu 


Leu 


Glu 


264 




210 










215 










220 










266 


G ly 


Pro 


Leu 


Thr 


Arg 


Lys 


Glu 


Val 


Glu 


Arg 


Ala 


Leu 


Gly 


Ser 


Pro 


Pro 


267 


2 25 










230 










23 5 












2 69 


Gly 


rhr 


Gly 


Val 


Ala 


Glu 


He 


Ala 


Ala 


Ser 


Leu 


Ala 


Arg 


Gly 


Lvs 


Thr 


27 0 










2 4 5 










250 










255 




27 2 


Ala 


Glu 


Ala 


Leu 


Gly 


Leu 


Ala 


Arg 


Arg 


Leu 


lyr 


Gly 


Glu 


Gly 


Tyr 


Ala 


273 








260 










265 










270 






27 5 


Pro 


Arg 


Ser 


Leu 


Val 


Sex- 


Gly 


Leu 


Leu 


Glu 


Val 


Phe 


Arg 


Glu 


Gly 


Leu 


276 






275 










280 










285 








278 


Tyr 


Ala 


Ala 


Phe 


Gly 


Leu 


Ala 


Gly 


Thr 


Pro 


Leu 


Pro 


Ala 


Pro 


Pro 


Gin 


2^9 




290 










295 










300 










281 


Ala 


Leu 


lie 


Ala 


Ala 


Met 


Thr 


Ala 


Leu 


Asp 


Glu 


Ala 


Met 


Glu 


Arg 


Leu 


282 


3 05 










310 










31 5 










320 


28-1 


Ala 


Arg 


Arg 


Ser 


Asp 


Ala 


Le u 


Ser 


Leu 


Glu 


Val 


Ala 


Leu 


Leu 


Glu 


Ala 


285 










3 25 










330 










335 




207 


Gly 


Arg 


Ala 


Leu 


Ala 


Ala 


Glu 


Ala 


Leu 


Pro 


Gin 


Pro 


Thr 


Gly 


Ala 


Pro 


288 








34 0 










34 5 










350 






290 


Ser 


Pro 


Glu 


Val 


Gly 


Pro 


Lys 


Pro 


Glu 


Ser 


Pro 


Pro 


Thr 


Pro 


Glu 


Pro 


291 






355 










3 60 










365 








29 3 


Pro 


Arg 


P ro 


Glu 


Glu 


Ala 


Pro 


Asp 


Leu 


Arg 


Glu 


Arg 


Trp 


Arg 


Ala 


Phe 


29 4 




370 










375 










380 










29 6 


Leu 


Glu 


Ala 


Leu 


Arg 


Pro 


Thr 


Leu 


Arg 


Ala 


Phe 


Va 1 


Arg 


Glu 


Ala 


Arg 


29 7 


385 










3 90 










39 5 










400 


299 


Pro 


Glu 


Val 


Arg 


Gl ll 


Gly 


Gin 


Leu 


Cys 


Leu 


Ala 


Phe 


Pro 


Glu 


Asp 


Lys 


3 00 










4 0 5 










4.10 










4.15 




302 


A. la 


Phe 


His 


Tyr 


Arg 


Lys 


Ala 


Ser 


Glu 


Gin 


Lys 


Va 1 


Arg 


Leu 


Leu 


Pro 


30 3 








420 










425 










43 0 






30 5 


Leu 


Ala 


Gin 


Ala 


His 


Phe 


Gly 


Val 


Glu 


Glu 


Val 


val 


Leu 


va 1 


Leu 


Glu 


306 






435 










440 










44 5 








3 08 


Gly 


Glu 


Lys 


Lys 


Lys 


Pro 


Glu 


Pro 


Lys 


Al a 


Pro 


Pro 


Gly 


Pro 


Thr 


Ser 


309 




450 










4 55 










460 










315 


<210> SEQ ID NO: 


: 5 
























316 


<211> LENGTH: 4: 


)4 
























317 


<212> TYPE: 


PRT 


























318 


<213> ORGANISM: 


Therm us 


thermoph.il us 














320 


<400> SEQUENCE: 


5 
























321 


Met 


Ser 


Ala 


Leu 


Tyr 


Arg 


Arg 


Phe 


Arg 


Pro 


Le u 


Thr 


Phe 


Gin 


Glu 


Val 


322 


1 








5 










10 










.15 




324 


Val 


Gly 


Gin 


Glu 


His 


Val 


Lys 


Glu 


Pro 


Leu 


Leu 


Lys 


Ala 


lie 


Arg 


Glu 


325 








20 










25 










30 






327 


Gly 


Arg 


Leu 


Ala 


Gin 


Ala 


Tyr 


Leu 


Phe 


Ser 


Gly 


Pro 


Arg 


Gly 


Val 


Gly 


328 






35 










4 0 










45 








330 


Lys 


Thr 


Thr 


Thr 


Ala 


Arg 


Le u 


Leu 


Ala 


Met 


Ala 


Val. 


Gly 


Cys 


Gin 


Gly 


3 31 




50 










55 










60 










333 


Glu 


Asp 


Pro 


Pro 


Cys 


Gly 


Val 


Cys 


Pro 


His 


Cys 


Gin 


Ala 


Val 


Gin 


Arg 


3 3 4 


65 










70 










75 










80 


336 


Gly 


Ala 


[Us 


Pro 


Asp 


Val 


Va 1 


Asp 


lie 


Asp 


Ala 


Ala 


Ser 


Asrt 


Asn 


Ser 


337 










85 










90 










95 





f 
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<210> 89 
<211> 182 

<212> PRT . 
<213> Deinococcus radiodurans 

<400> 89 

Pro Trp Pro Gin Asp Val Val Val Phe Asp Leu Glu Thr Thr Gly Phe 

15 10 15 rt ^ 

Ser Pro Ala Ser Ala Ala He Val Glu He Gly Ala Val Arg He v ^j6^ 



20 25 30 



Gly Gly Gin He Asp Glu Thr Leu Lys Phe Glu Thr Leu Val Arg Pro 
35 40 45 

Thr Arg Pro Asp Gly Ser Met Leu Ser He Pro Trp Gin Ala Gin Arg 
50 55 60 



/xaa^Lys 



Val His Gly He Ser Asp Glu Met Val Arg Arg Ala Pro AlafXaayLys 
65 70 75 V_y 80 C i 

(fh }W 

Asp Val Leu Pro Asp Phe Phe Asp Phe Val Asp Gly Ser Ala Val Val 

85 90 95 J 

Ala His Asn Val Ser Phe Asp Gly Gly Phe Met Arg Ala Gly Ala Glu U t # , 
100 105 110 



Arg Leu Gly Leu Ser Trp Ala Pro Glu Arg Glu Leu Cys Thr Met Gin 
115 120 125 

Leu Ser Arg Arg Ala Phe Pro Arg Glu Arg Thr His Asn Leu Thr Val 
130 135 ^ 140 

Leu Ala Glu Arg Leu Gly Leu Glu Phe Ala Pro Gly Gly Arg His Arg 
145 150 155 160 

Ser Tyr Gly Asp Val Gin Val Thr Ala Gin Ala Tyr Leu Arg Leu Leu 
165 170 175 

Glu Leu Leu Gly Glu Arg 
180 



f^p ^ 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/716,964 



DATE: .12/07/2000 
TIME: 07:50:04 



1224 
124 2 



Input; Set : 
Ou tpu t Set : 

16 M:270 C: Current Application 

17 M:271 C: Current Filing Date 
55 2 M : 2 5 8 W : Mandatory Feature 

Mandatory Feature 
(46) "n" or "Xaa" 
Mandatory feature 
Mandatory Feature 
(46) "n" or "Xaa" 
Ma nd a t ory Pea tur e 
Mandatory Feature 
(46) "n" or "Xaa" 
(46) "n" or "Xaa" 
(46) " n" or "Xaa" 
Mandatory Feature 
Mandatory Feature 
(46) "n" or "Xaa" 
Manda tory t'ea ture 
Mandatory Feature 
(46) "n" or "Xaa" 
Mandatory Feature 
Ma n d a tor y Fe a t u r e 
(46) "n" or "Xaa" 
Mandatory Feature 
Mandatory Feature 
Ma nda tory Fea ture 
Mandatory Feature 
(4 6) "n" or "Xaa" 
Mandatory Feature 
Mandatory Feature 
Mandatory Feature 
Mandatory Feature 
(46) "n" or "Xaa" 
Mandatory Feature 
Mandatory Feature 
Mandatory Feature 
Mandatory Feature 



552 M:258 W: 
5 52 M:340 W: 

1062 M:258 W: 

10 62 M:258 W: 

10 62 M: 34 0 W : 

10 7 4 M:258 W: 

1074 M:258 W: 

10 74 M: 34 0 W: 

M : 3 4 1 W : 

M : 341 W: 

1506 M:258 W: 

1506 M:258 W: 

15 06 M: 340 W: 

1519 M:258 W: 

1519 M:258 W: 

1519 M: 34 0 W: 

15 3 2 M:258 W: 

:153 2 M:258 W: 

: 15 32 M: 34 0 W: 

2169 M:258 W: 

2169 M:258 W: 

: 2.169 M: 258 W: 

216 9 M:258 W: 

216 9 M:34C) W: 

2252 M:258 W: 

2252 M:250 W: 

2252 M:258 W: 

2252 M:258 W: 
M 
M 



2252 
2255 



3 40 W: 
25 



W 

2255 M:258 W 
2255 M:258 W 
2255 M:258 W 



A:\R10301 .app 

N:\CRF3\1207 2 000\I716964 .raw 

Number differs, Replaced Application Number 
differs, Replaced Current Filing Date 
missing, <221> not found tor SKQ IDS : 17 
missing, <222> not found for SEQ 1D#:.17 
used: Feature required/ for SEQ IDs: 17 
missing, <221> not found for SEQ IDS: 29 
missing, <222> not found for SEQ IDS: 29 
used: Feature required , for SEQ 1D# : 29 
missing, <221> not found for SEQ IDS: 30 
missing, <222> not found for SEQ IDS: 30 
used: Feature required, for SEQ ID#:30 
used, for SEQ im : 4 2 
used, for SEQ ID& : 43 

missing, <2 2J> not found for SEQ IDS: 66 
missing, <222> not found for SEQ ID#:66 
used: Feature required, for SEQ lDif:66 
missing, <221> not found for SEQ ID#:67 
missing, <222> not found for SEQ 1D# : 67 
used: Feature required, for SEQ TD#:67 
missing, <221> not found for SEQ ID#:68 
missing, <222> not found for SEQ ID#:68 
used: Feature required, for SEQ ID#:6 8 
missing, <220> not found for SEQ IDs : 8 9 
missing, <221> not found for SEQ ID#:89 
missing, <222> not found for SEQ IDff:39 
missing, <223> not found for SEQ IDtr:89 
used: Feature required, for SEQ XD#:89 
missing, <220> not found for SEQ lD(r:91. 
missing, <221> not found for SEQ IDS: 91 
missing, <222> not found for SEQ IDif:91 
missing, <22 3> not found for SEQ IDS: 91 
used: Fe a t u re r eq u i red , for S EQ I D # : 9 1 
missing, <220> not found for SEQ ID#:91 
missing, <22I> not found for SEQ IDtf:91 
missing, <2 22> not found for SEQ ID#:91 
missing, <2 23> not found for SEQ ID#:91 



34 0 Repeated in SeqNo*=91 
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